Viral Proteins Originated De Novo by Overprinting Can Be Identified by Codon Usage: Application to the “Gene Nursery” of Deltaretroviruses
نویسندگان
چکیده
A well-known mechanism through which new protein-coding genes originate is by modification of pre-existing genes, e.g. by duplication or horizontal transfer. In contrast, many viruses generate protein-coding genes de novo, via the overprinting of a new reading frame onto an existing ("ancestral") frame. This mechanism is thought to play an important role in viral pathogenicity, but has been poorly explored, perhaps because identifying the de novo frames is very challenging. Therefore, a new approach to detect them was needed. We assembled a reference set of overlapping genes for which we could reliably determine the ancestral frames, and found that their codon usage was significantly closer to that of the rest of the viral genome than the codon usage of de novo frames. Based on this observation, we designed a method that allowed the identification of de novo frames based on their codon usage with a very good specificity, but intermediate sensitivity. Using our method, we predicted that the Rex gene of deltaretroviruses has originated de novo by overprinting the Tax gene. Intriguingly, several genes in the same genomic region have also originated de novo and encode proteins that regulate the functions of Tax. Such "gene nurseries" may be common in viral genomes. Finally, our results confirm that the genomic GC content is not the only determinant of codon usage in viruses and suggest that a constraint linked to translation must influence codon usage.
منابع مشابه
Evolution of Viral Proteins Originated De Novo by Overprinting
New protein-coding genes can originate either through modification of existing genes or de novo. Recently, the importance of de novo origination has been recognized in eukaryotes, although eukaryotic genes originated de novo are relatively rare and difficult to identify. In contrast, viruses contain many de novo genes, namely those in which an existing gene has been "overprinted" by a new open ...
متن کاملIdentification of Synonymous Codon Usage Bias in the Pseudorabies Virus UL31 Gene
Background: Little knowledge of synonymous codon usage pattern of pseudorabies virus (PRV) genome, especially the UL31 gene in the process for its evolution is available. Objectives: In the present study, the codon usage bias between PRV UL31 sequence and the UL31-like sequences was identified. Materials and Methods: We used a comprehensive analysi...
متن کاملIdentification of an overprinting gene in Merkel cell polyomavirus provides evolutionary insight into the birth of viral genes.
Many viruses use overprinting (alternate reading frame utilization) as a means to increase protein diversity in genomes severely constrained by size. However, the evolutionary steps that facilitate the de novo generation of a novel protein within an ancestral ORF have remained poorly characterized. Here, we describe the identification of an overprinting gene, expressed from an Alternate frame o...
متن کاملEvaluation of Crimean-Congo Hemorrhagic Fever Orthonairovirus AviTagged Nucleoprotein for Potential Application in Diagnosis
Background: Crimean-Congo hemorrhagic fever (CCHF) is an acute viral zoonotic disease, with a mortality rate of 30-50%. There is no approved vaccine or any specific antiviral treatment for CCHF; therefore, the rapid diagnosis seems to be crucial for both efficient supportive therapy and control of infection spread. In this study, the potency of recombinant nucleoprotein of virus expressed in pr...
متن کاملP-22: Codon Optimization of Coagulation Factor IX and Cloning in to The Chinese Hamster Ovary Cells
Background Human coagulation factor IX is a 57kDa plasma serine protease made in Liver which plays a vital role in the blood coagulation cascade. FIX deficiency causes severe disorder Hemophilia B or Christmas disease. Nowadays, recombinant proteins have important roles in treatment of diseases. Although, cultivated mammalian cells because of their ability for producing properly folded protein ...
متن کامل